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(i) APPLICANT: 


MOORE , PAUL A. 
RUBEN, STEVEN M. 
EBNER, REINHARD 


(ii) TITLE OF INVENTION: TISSUE PLASMINOGEN ACTIVATOR- LIKE PROTEASE 


( iii) NUMBER OF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: USA 

(F) ZIP: 20850 


(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/084,491 

(B) FILING DATE: 27-MAY-1998 

(C) CLASSIFICATION: 

( viii ) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BROOKES, ANDERS A. 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE/DOCKET NUMBER: PF378 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8439 


(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 124 .. 913 


(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 124.. 184 


(ix) 


FEATURE: 


(A) NAME/KEY: mat_peptide 

(B) LOCATION: 187.. 913 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TTACCAGAAC AGCATAACAA GGGCAGGTCT GACTGCAAGC TGGGACTGGG AGGCAGAGCC 60 

GCCGCCAAGG GGGCCTCGGT TAAACACTGG TCGTTCAATC ACCTGCAAGA C G AAGAGGC A 120 


AGG 

ATG 

Met 

-21 

CTG 

Leu 

-20 

TTG 

Leu 

GCC 

Ala 

TGG 

Trp 

GTA 

Val 

CAA 

Gin 

-15 

GCA 

Ala 

TTC 

Phe 

CTC 

Leu 

GTC 

Val 

AGC 

Ser 

-10 

AAC 

Asn 

ATG 

Met 

CTC 

Leu 

168 

CTA 

GCA 

GAA 

GCC 

TAT 

GGA 

TCT 

GGA 

GGC 

TGT 

TTC 

TGG 

GAC 

AAC 

GGC 

CAC 

216 

Leu 

Ala 

-5 

Glu 

Ala 

Tyr 

Gly 

Ser 

1 

Gly 

Gly 

Cys 

Phe 

5 

Trp 

Asp 

Asn 

Gly 

His 

10 


CTG 

TAC 

CGG 

GAG 

GAC 

CAG 

ACC 

TCC 

CCC 

GCG 

CCG 

GGC 

CTC 

CGC 

TGC 

CTC 

264 

Leu 

Tyr 

Arg 

Glu 

Asp 

15 

Gin 

Thr 

Ser 

Pro 

Ala 

20 

Pro 

Gly 

Leu 

Arg 

Cys 
2 5 

Leu 


AAC 

TGG 

CTG 

GAC 

GCG 

CAG 

AGC 

GGG 

CTG 

GCC 

TCG 

GCC 

CCC 

GTG 

TCG 

GGG 

312 

Asn 

Trp 

Leu 

Asp 

30 

Ala 

Gin 

Ser 

Gly 

Leu 

35 

Ala 

Ser 

Ala 

Pro 

Val 

40 

Ser 

Gly 


GCC 

GGC 

AAT 

CAC 

AGT 

TAC 

TGC 

CGA 

AAC 

CCG 

GAC 

GAG 

GAC 

CCG 

CGC 

GGG 

360 

Ala 

Gly 

Asn 

45 

His 

Ser 

Tyr 

Cys 

Arg 

50 

Asn 

Pro 

Asp 

Glu 

Asp 

55 

Pro 

Arg 

Gly 


CCC 

TGG 

TGC 

TAC 

GTC 

AGT 

GGC 

GAG 

GCC 

GGC 

GTC 

CCT 

GAG 

AAA 

CGG 

CCT 

408 

Pro 

Trp 

60 

Cys 

Tyr 

Val 

Ser 

Gly 

65 

Glu 

Ala 

Gly 

Val 

Pro 

70 

Glu 

Lys 

Arg 

Pro 


TGC 

GAG 

GAC 

CTG 

CGC 

TGT 

CCA 

GAG 

ACC 

ACC 

TCC 

CAG 

GCC 

CTG 

CCA 

GCC 

456 

Cys 

75 

Glu 

Asp 

Leu 

Arg 

Cys 

80 

Pro 

Glu 

Thr 

Thr 

Ser 
8 5 

Gin 

Ala 

Leu 

Pro 

Ala 

90 


TTC 

ACG 

ACA 

GAA 

ATC 

CAG 

GAA 

GCG 

TCT 

GAA 

GGG 

CCA 

GGT 

GCA 

GAT 

GAG 

504 

Phe 

Thr 

Thr 

Glu 

lie 

95 

Gin 

Glu 

Ala 

Ser 

Glu 

'100 

Gly 

Pro 

Gly 

Ala 

Asp 

105 

Glu 


GTG 

CAG 

GTG 

TTC 

GCT 

CCT 

GCC 

AAC 

GCC 

CTG 

CCC 

GCT 

CGG 

AGT 

GAG 

GCG 

552 

Val 

Gin 

Val 

Phe 

110 

Ala 

Pro 

Ala 

Asn 

Ala 

115 

Leu 

Pro 

Ala 

Arg 

Ser 

120 

Glu 

Ala 


GCA 

GCT 

GTG 

CAG- 

CCA 

GTG 

ATT 

GGG 

ATC 

AGC 

CAG 

CGG 

GTG 

CGG 

ATG 

AAC 

600 

Ala 

Ala 

Val 

125 

Gin 

Pro 

Val 

lie 

Gly 

130 

lie 

Ser 

Gin 

Arg. 

Val 

135 

Arg 

Met 

Asn 


TCC 

AAG 

GAG 

AAA 

AAG 

GAC 

CTG 

GGA 

ACT 

CTG 

GGC 

TAC 

GTG 

CTG 

GGC 

ATT 

648 

Ser 

Ly s 

140 

Glu 

Lys 

Lys 

Asp 

Leu 

145 

Gly 

Thr 

Leu 

Gly 

Tyr 

150 

Val 

Leu 

Gly 

lie 


ACC 

ATG 

ATG 

GTG 

ATC 

ATC 

ATT 

GCC 

ATC 

GGA 

GCT 

GGC 

ATC 

ATC 

TTG 

GGC 

696 

Thr 

155 

Met 

Met 

Val 

lie 

lie 

160 

lie 

Ala 

lie 

Gly 

Ala 

165 

Gly 

lie 

lie 

Leu 

Gly 

170 


TAC 

TCC 

TAC 

AAG 

AGG 

GGG 

AAG 

GAT 

TTG 

AAA 

GAA 

CAG 

CAT 

GAT 

CAG 

AAA. 

7 4 4 

Tyr 

Ser 

Tyr 

Lys 

Arg 

175 

Gly 

Lys 

Asp 

Leu 

Lys 

180. 

Glu 

Gin 

His 

Asp 

Gin 

135 

Lys 
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GTA TGT GAG AGG GAG ATG CAG CGA ATC ACT CTG CCC TTG TCT GCC TTC 

Val Cys Glu Arg Glu Met Gin Arg lie Thr Leu Pro Leu Ser Ala Phe 

190 195 200 

ACC AAC CCC ACC TGT GAG ATT GTG GAT GAG AAG ACT GTC GTG GTC CAC 840 

Thr Asn Pro Thr Cys Glu lie Val Asp Glu Lys Thr Val Val Val His 

205 210 215 

* 

ACC AGC CAG ACT CCA GTT GAC CCT CAG GAG GGC AGC ACC CCC CTT ATG 888 

Thr Ser Gin Thr Pro Val Asp Pro Gin Glu Gly Ser Thr Pro Leu Met 

220 225 230 

GGC CAG GCC GGG ACT CCT GGG • GCC T GAGCCCCCCC AGTGGGCAGG 933 

Gly Gin Ala Gly Thr Pro Gly Ala 
235 240 . 

AGCCCATGCA GACACTGGTG CAGGACAGCC CACCCTCCTA CAGCTAGGAG G AAC T AC CAC 993 

TTTGTGTTCT GGTTAAAACC CTACCACTCC CCCGCTTTTT TGGCGAATCC TAG T AAG AG T 1053 

GACAGAAGCA GGTGGCCCTG TGGGCTGAGG GTAAGGCTGG GTAGGGTCCT AACAGTGCTC 1113 

CTTGTCCATC CCTTGGAGCA GATTTTGTCT GTGGATGGAG ACAGTGGCAG CTCCCACAGT 1173 

i 

GATGCTGCTG CTAAGGGCTT CCAAACATTG CCTGCACCCC TGGAACTGAA CCAGGGATAG 1233 

ACGGGGAGCT CCCCCAGGCT CCTCTGTGCT TTACTAAGAT GGCTCAGTCT CCACTGTGGG 1293 

CTTGAGTGGC AT AC AC T G T T ATTCATGGTT AAGGTAAAGC AGGTCAAGGG ATGGCATTGA 1353 

AAAAftTATAT TTAGTTTTTA AAATATTTGG GATGGAACTC CCTACTGACC TCTGACAACT 1413 

GGAAACGAGT TTGTACTGAA GTCAGAACTT TGGGTTGGGA AT GAG AT C T A GGTTGTGGCT 1473 

GCTGGTATGC TTCAGCTTGC TGGCAATGAT GTGCCTTGAC AACCGTGGGC CAGGCCTGGG 1533 

CCCAGGGACT CTTCCTGTTT CATAAGGAAA GGAAGAATTG CACTGAGCAT TCCACTTAGG 1593 

AAGAGGATAG AGAAGGATCT GCTCCGCCTT TGGCCACAGG AGCAGAGGCA GACCTGGGAT 1653 

GCCCCAGTTT CTCTTCAGGG ATGGATAGTG ACCTGTCTTC ATTTTGCACA GGTAAGAGAG 1713 

TAGTTAGCTA ACCTATGGGA ATTATACTGT GGGGCCTTGT GAGCTGCTTC TAAGAGGCTA 1773 

ACCTGGAAAC TAAGCTCAGA GGCAAGGTAA TAAAGCACTT CAGGGCTTGC TCCCCAAGTG 1833 

GGCCTGATTT AGCAGGTGGT CTGCGGGCGT CCAGGTCAGC ACCTTCCTGT AGGGCACTGG 1893. 

GGCTAGGGTC ACAGCCCCTA ACTCATAAAG CAATCAAAGA ACCATTAGAA AGGGCTCATT 1953 

AAGCCTTTTG GACACAGGAC CCC AG AG AGG AAAAAG T GAC TTGCCCAAGG TCGTAAGCAA 2013 

GCTACTGGCA TGGCAAGAGC CCAGCTTCCT GACGGAGCGC AACATTTCTC CACTGCACTG 2073 

r — ' 

TGCTAGCAGC TCAGCAGGGC CTCTAACCTG TGATGTCACA CTCAAGAGGC CTTGGCAGCT 2133 

CCTAGCCATA GAGCTTCCTT TCCAGAACCC TTCCACTGCC CAATGTGGAG ACAGGGGTTA 2193 

GTGGGGCTTT CTATGGAGCC ATCTGCTTTG GGGACCTAGA CCTCAGGTGG TCTCTTGGTG 2253 
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TTAGTGATGC TGGAGAAGAG AATATTACTG GTTTCTACTT TTCTATAAAG GCATTTCTCT 
ATAAAAAAAA AAAAAA 


(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 


Met 

-21 

Leu 

-20 

Leu 

Ala 

Trp 

Val 

Gin 

-15 

Ala 

Phe 

Leu 

Val 

Ser 

-10 

Asn 

Met 

Leu 

Leu 

Ala 

-5 

Glu 

Ala 

Tyr 

Gly 

Ser 

1 

Gly 

Gly 

Cys 

Phe 

5 

Trp 

Asp 

Asn 

Gly 

His 

10 

Leu 

Tyr 

Arg 

Glu 

Asp 

15 

Gin 

Thr 

Ser 

Pro 

Ala 

20 

Pro 

Gly 

Leu 

Arg 

Cys 

25 

Leu 

Asn 

Trp 

Leu 

Asp 

30 

Ala 

Gin 

Ser 

Gly 

Leu 

35 

Ala 

Ser 

Ala 

Pro 

Val 

40 

Ser 

Gly 

Ala 

Gly 

Asn 

45 

His 

Ser 

Tyr 

Cys 

Arg 

50 

Asn 

Pro 

Asp 

Glu 

Asp 

55 

Pro 

Arg 

Gly 

Pro 

Trp 

60 

Cys 

Tyr 

Val 

Ser 

Gly 

65 

Glu 

Ala 

Gly 

Val 

Pro 

70 

Glu 

Lys 

Arg 

Pro 

Cys 

75 

Glu 

Asp 

Leu 

Arg 

Cys 

80 

Pro 

Glu 

Thr 

Thr 

Ser 

85 

Gin 

Ala 

Leu 

Pro 

Ala 

90 

Phe 

Thr 

Thr 

Glu 

He 

95 

Gin 

Glu 

Ala 

Ser 

Glu 

100 

Gly 

Pro 

Gly 

Ala 

Asp 

105 

Glu 

Val 

Gin 

Val 

Phe 

110 

Ala 

Pro 

Ala 

Asn 

Ala 

115 

Leu 

Pro 

Ala 

Arg 

Ser 

120 

Glu 

Ala 

Ala 

Ala 

Val 

125 

Gin 

Pro 

Val 

lie 

Gly 

130 

lie 

Ser 

Gin 

Arg 

Val 

135 

Arg 

Met 

Asn 

Ser 

Lys 

140 

Glu 

Lys 

Lys 

Asp 

Leu 

145 

Gly 

Thr 

Leu 

Gly 

Tyr 

150 

Val 

Leu 

Gly 

lie 

Thr 

155 

Met 

Met 

Val 

lie 

lie 

160 

lie 

Ala 

lie 

Gly 

Ala 

165 

Gly 

lie 

lie 

Leu 

Gly 

170 

Tyr 

Ser 

Tyr 

Lys 

Arg 

175 

Gly 

Lys 

Asp 

Leu 

Lys 

180 

Glu 

Gin 

His 

Asp 

Gin 

185 

Lys 

Val 

Cys 

Glu 

Arg 

190 

Glu 

Met 

Gin 

Arg 

lie 

195 

Thr 

Leu 

Pro 

Leu 

Ser 

200 

Ala 

Phe 

Thr 

Asn 

Pro 

205 

Thr 

Cys 

Glu 

lie 

Val 

210 

Asp 

Glu 

Lys 

Thr 

Val 

215 

Val 

Val 

His 

Thr 


2313 

2329 
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Ser Gin Thr Pro Val Asp Pro Gin Glu Gly Ser Thr Pro Leu Met Gly 
220 225 230 235 

Gin Ala Gly. Thr Pro Gly Ala 

240 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 
.(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 


Tyr 

Val 

Phe 

Lys 

Ala 

Gly 

Lys 

Tyr 

Ser 

Ser 

Glu 

Phe 

Cys 

Ser 

Thr 

Pro 

1 




5 





10 





15 


Ala 

Cys 

Ser 

Glu 

Gly 

Asn 

Ser 

Asp 

Cys 

Tyr 

Phe 

Gly 

Asn 

Gly 

Ser 

Ala 




20 





25 





30 



Tyr 

Arg 

Gly 

Thr 

His 

Ser 

Leu 

Thr 

Glu 

Ser 

Gly 

Ala 

Ser 

Cys 

Leu 

Pro 



35 





40 





45 




Trp 

Asn 

Ser 

Met 

lie 

Leu 

lie 

Gly 

Lys 

Val 

Tyr 

Thr 

Ala 

Gin 

Asn 

Pro 


50 





55 





60 





Ser 

Ala 

Gin 

Ala 

Leu 

Gly 

Leu 

Gly 

Lys 

His 

Asn 

Tyr 

Cys 

Arg 

Asn 

Pro 

65 





70 





75 





80 

Asp 

Gly 

Asp 

Al a 

Lys 

Pro 

Trp 

Cys 

His 

Val 

Leu 

Lys 

Asn 

Arg 

Arg 

Leu 





85 





90 





95 


Thr 

Trp 

Glu 

Tyr 

Cys 

Asp 

Val 

Pro 

Ser 

Cys 

Ser 

Thr 

Cys 

Gly 

Leu 

Arg 




100 





105 





110 



Gin 

Tyr 

Ser 

Gin 

Pro 

Gin 

Phe 

Arg 

lie 

Lys 

Gly 

Gly 

Leu 

Phe 

Ala 

Asp 



115 





120 





125 




lie 

Al a 

Ser 

His 

Pro 

Trp 

Gin 

Ala 

Ala 

lie 

Phe 

Ala 

Lys 

His 

Arg 

Arg 


130 





135 





140 



• 


Ser 

Pro 

Gly 

Glu 

Arg 

Phe 

Leu 

Cys 

Gly 

Gly 

lie 

Leu 

lie 

Ser 

Ser 

Cys 

145 





150 





155 





160 

Trp 

lie 

Leu 

Ser 

Ala 

Ala 

His 

Cys 

Phe 

Gin 

Glu 

Arg 

Phe 

Pro 

Pro 

His 





165 





170 





175 


His 

Leu 

Thr 

Val 

lie 

Leu 

Gly 

Arg 

Thr 

Tyr 

Arg 

Val 

Val 

Pro 

Gly 

Glu 




180 





185 





190 



Glu 

Glu 

Gin 

Lys 

Phe 

Glu 

Val 

Glu 

Lys 

Tvr 

lie 

Val 

His 

Lys 

Glu 

Phe 



195 





200 





205 




Asp 

Asp 

Asp 

Thr 

Tyr 

Asp 

Asn 

Asp 

He 

Als. 

Leu 

Leu 

Gin 

Leu 

Lys 

Ser 


S3 



210 





215 





220 





Asp 

225 

Ser 

Ser 

Arg 

Cys 

Ala 

230 

Gin 

Glu 

Ser 

Ser 

Val 

235 

Val 

Arg 

Thr 

Val 

Cys 

240 

Leu 

Pro 

Pro 

Ala 

Asp 

245 

Leu 

Gin 

Leu 

Pro 

Asp 

250 

Trp 

Thr 

Glu 

Cys 

Glu 

255 

Leu 

Ser 

Gly 

Tyr 

Gly 

260 

Lys 

His 

Glu 

Ala 

Leu 

265 

Ser 

Pro 

Phe 

Tyr 

Ser 

270 

Glu 

Arg 

Leu 

Lys 

Glu 

275 

Ala 

His 

Val 

Arg 

Leu 

280 

Tyr 

Pro 

Ser 

Ser 

Arg 

285 

Cys 

Thr 

Ser 

Gin 

His 

290 

Leu 

Leu 

Asn 

Arg 

Thr 

295 

Val 

Thr 

Asp 

Asn 

Met 

300 

Leu 

Cys 

Ala 

Gly 

Asp 

305 

Thr 

Arg 

Ser 

Gly 

Gly 

310 

Pro 

Gin 

Ala 

Asn 

Leu 

315 

His 

Asp 

Ala 

Cys 

Gin 

320 

Gly 

Asp 

Ser 

Gly 

Gly 

325 

Pro 

Leu 

Val 

Cys 

Leu 

330 

Asn 

Asp 

Gly 

Arg 

Met 

335 

Thr 

Leu 

Val 

Gly 

lie 

He 

Ser 

Trp 

Gly 

Leu 

Gly 

Cys 

Gly 

Gin 

Lys 

Asp 

Val 




340 





345 





350 



Pro 

Gly 

Val 

355 

Tyr 

Thr 

Lys 

Val 

Thr 

360 

Asn 

Tyr 

Leu 

Asp 

Trp 

365 

lie 

Arg 

Asp 

Asn 

Met 

370 

Arg 

Pro 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:' single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 


ATTGCACTGA 

GCATTCCACT 

TAGGAAGAGG 

ATAGAGAAGG 

ATCTGCTCCG 

CCTTTGGCCA 

60 

C AG GAG C AG A 

GGCAGACCTG 

GGATGCCCCA 

TTTCTCTTCA 

GGGATGGATA 

GTGACCTGTC 

120 

TTCATTTTGC 

ACAGGTAAGA 

GAGTAGTTAG 

CTAACCTATG 

GGAATTATAC 

TGTGGGGCCT 

180 

TGTAGCTGCT 

TCTAAGAGGC 

TAACCTGGAA 

ACTAAGCTCA 

GAGGCAAGGT 

AATAAAGCAC 

240 

TTCAGGGCTT 






250 


(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATAGAGAAAT GCCTTTATAG AAAAGTAGAA ACCAGTAATA ' TTCTCTTCTC CAGCATCACT 60 
AACACCAAGA GACCACCTGA GGTCTAGGTC CCCAAAGCAG ATGGCTCCAT AGAAAGCCCC 120 
ACTAACCCGT CTCCACATTG GGCAGTGGAA GGGTTCTGGA AAGGAAGCTC TATGGCTA GG 180 
AGCTGCCAAG GCCTCTTGAG TGTGACATCA C AG G T TAG AG GCCCTGCTGA GCTGCTAGCA 2 40 
CAGTGCA 247 


(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 


AATTCGGCAA 

GAGTAACAGC 

ATAACAAGGG 

TAGGTCTGAC 

TGCAGCTGGG 

ACTGGGAGGC 

60 

AGAGCACGCC 

AAGGGGGCCT 

CGGTTAAACA 

CTGGTCGTTC 

AATCACCTGC 

AAAC G AGG AG 

120 

GCAAGGATGC 

TGTTGGCCTG 

GG TACAGCAT 

TCCTGGTCAG 

CAACATGCTC 

CTAGCGTAAG 

180 

CCTATGGATC 

TGGAGGCTGT 

TTCTGGGACA 

ACGGCCACTG 

TACCCGGAGG 

ACCAGACCTT 

240 

CCCGGCCGGT 

CCTCGTGCCT 

CAACTGGCTG 

GACGCGCAGG 

GCTGCCTGGG 

CCCCCTTTTC 

300 

GGTCAAATTT 

CACAGTTTAC 

TTCGAAACCG 

GGACGGGGCC 

GTGGGGGCCC 

TGGTGGTTAG 

360 

TTTGGGGTCG 

ggttttctta 

AAAAAGGT T T 

TTGGGGCCGG 

TTTTCGGAAC 

CATTTCGGTT 

• 420 

GAATTTTTTA 

GGGAAATTTC 

AGGAGTTTTT 

TAAGGGCCAT 

T 


461 


(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 


GGCAAGTTGC 

AGAACTGGAA 

ACGAGTTTGT 

AC AGAAGT C A 

GAACTTTGGG 

T TAGGAATG A 

60 

GATCTAGGTT 

GTGGCTGCTG 

GTATGCTTCA 

TTGCTGGCAA 

TAATGTGCCT 

TGACAACCGT 

120 

GGGCCAGGCC 

TGGGACCAGG 

GACTCTTCCT 

GTTTCATAAG 

GAAAGGAAGA 

ATTGCACTGA 

180 

GCATTCCACT 

TAGGAAGAGG 

ATAGAGCAAG 

GAATCTGCTC 

CGCTTTGGCC 

ACAGGAGCAG 

240 

AGGCAGACCT 

GGGATGCCCC 

AGTTCTCTTT 

CAGGGATGGG 

ATAGTGACCT 

GTCTTACATT 

300 

TTGCACAGGT 

AAAGAGAGTT 

AGTTAGCTAA 

CCTATTGGGC 

TTTATTACTT 

GGGGCTTGTG 

360 

AGCTGCTTTT 

TAAGAGGTTA 

ACCTGGAACT 

AAAGTTCAG 



399 


(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

{ C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

TAATTCGGCA AGGGACAGGT CTGACTGCAG CTGGGACTGG GAGGCAGAGC CGTCAAGGGG 60 
GCCTCGGTTA AACACTGGTC GTTCAATCAC CTGCAACGAG AGGCAAGGAT GCTGTTGGCC 120 
TGGGTACAAG CATTCCTGTC AGCAACATGC TCCTAGCAGA AAGCCTATGG ATCTGGGAGG 180 
CTGTTTCTGG GACAACGGCC ACCTGTACCG GAGGACCAGA CCTCCCCGGC CGGGCCTTCC 240 
GTGGCCTTCA ATTGGTTTGA CGTGGCAAAG GGGCTTGTCT GGCCCTTTTG GGGGAAAATT 300 
ACAAGTTTTA ATTGTCCCGG AAAACCTGGA GAGG 334 


(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 


AATTCGGCAG 

AGGGAGA GGG 

AG AT GCA GCG 

AATCACTCTG 

CCCTTGTCTG 

CCTTCACCAA 

60 

CCCCACCTGT 

GAGATTGTGG 

ATGAGAAGAC 

TGTCGTGGTC 

C AC AC C AG C C 

AG AC T C C AG T 

120 

TGACCCTCAG 

GAGGGCAGCA 

CCCCCCTTAT 

GGGACCAGGC 

CGGGGACTCC 

TGGGGCCTGA 

180 

GCCCCCCAGT 

GGGGCAGGAG 

CCATGGCAGA 

CACTGGTGCA 

GGACAGCCAC 

CCTCCTTACA 

240 

GCTAGGGGGA 

ACTACCACTT 

TGTGTTTCTG 

GTTTAAAACC 

CTACCACTCC 

CGGATTTTTT 

300 

GGCGGATTCC 

T TAG T T AAGA 

GTACAGAAGC 

AGGTGGGCCT 

ATGGCTTGGA 

GGGTAAGGTG 

360 

GGGTAGGGTT 

CCTAAAAGTG’ 

GGTTCTTGGT 

TGCTCCTGGG 

AGGAAGATTT 

TGGTTTTGGT 

420 

GGGGACAGTG 

GCAGTTTCCA 

CAGGTTGTTG 

TGTTAAGGGG 

TTCAAAAAAT 

TG 

472 


(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 


GGGCACGAGA 

TGAACTCCAA 

GGAGAAAAAG 

GACCTGGGAA 

CTCTGGGTAT 

GACGGTCCCC 

60 

CACCCCTGCC 

CTTGTTGGGA 

T T CAT C AAG A 

GATGTCATTT 

GCTGATTGTC 

TAGGGTGTGG 

120 

CTAATGGGAC 

CTTGTGTCCT 

ATCCTTGGCA 

GGCTACGTGC 

TGGGCATTAC 

CATGATGGTG 

180 

ATCATCATTG 

CCATCGGAGC 

TGGCATCATC 

TTGGGCTACT 

CTACAAGAGG 

TCAGTAGCTT 

240 

CTCTTCTGGG 

CCCTCTTAGG 

AGGAGGGGAG 

G AAG G T AC AC 

AAAGTCAAAC 

T 

291 


(2) INFORMATION FOR SEQ ID NO: 11': 

(i) SEQUENCE CHARACTERISTICS.: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCCGACATG TCTGGAGGCT GTTTCTGG 28 

(2) INFORMATION FOR SEQ ID NO: 12: 


(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

GGCGGAAGCT TATTAGGCCC CAGGAGTCCC GGC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GGCCGGGATC CGCCATCATG CTGTTGGCCT GGGTAC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GGCCGGGTAC CTTATTAGGC CCCAGGAGTC CCGGC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCCGGGATC CGCCATCATG CTGTTGGCCT GGGTAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: ' SEQ ID NO:16: 
GGCCGGGTAC CTTATTAGGC CCCAGGAGTC CCGGC 


r Application No.: 09/084.491 
NOTICE TO COMPLY WITH REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

/%' -'c 0 

The nucleotide and/or amino acid sequence disclosure contained in this application does not 
omply with the requirements for such aJlisclosL^I afset forth in 37 C.F.R. 1.821 - 1.825 for the 

attention is directed to these regulatio 
18230, May 1, 1990. 




irementsof 37 C.F.R. 1.821-1.825. Applicant’s 
ished at 1114 OG 29, May 15, 1990 and at 55 FR 


' I 1 2. This application does not contain, as a separate part of the disclosure on paper copy, a “Sequence 
I— I Listing” as required by 37 C.F.R. 1.821(c). 


.821(c) 


0 l * l 

O ^ 




3. A copy of the “Sequence Listing” in computer readable form has not been submitted as required by 
37 C.F.R. 1.821(e). 

4. A copy of the “Sequence Listing” in computer readable form has been submitted. However, the 
content of the computer readable form does not comply with the requirements of 37 C.F.R. 1.822 
and/or 1.823, as indicated on the attached copy of the marked -up “Raw Sequence Listing.” 


5. The computer readable form that has been filed with this application has been found to be damaged 
and/or unreadable as indicated on the attached CRF Diskette Problem Report. A Substitute 
computer readable form must be submitted as required by 37 C.F.R. 1 .825(d). 


6. The paper copy of the “Sequence Listing” is not the same as the computer readable from of the 
“Sequence Listing” as required by 37 C.F.R. 1.821(e). 

7. Other: 


Applicant Must Provide: 



An initial or substitute computer readable form (CRF) copy of the “Sequence Listing". 


An initial o r substitute paper copy of the “Sequence Listing”, as well as an amendment directing its entry 
into the specification. 

A statement that the content of the paper and computer readable copies are the same and, where 
applicable, include no new matter, as required by 37 C.F.R. 1.821(e) or 1.821(f) or 1.821(g) or 
1.825(b) or 1.825(d). 


For questions regarding compliance to these requirements, please contact: 

For Rules Interpretation, call (703) 308-4216 
For CRF Submission Help, call (703) 308-4212 
Patentln Software Program Support (SIRA) 


Technical Assistance 703-287-0200 

To Purchase Patentln Software 703-306-2600 


PLEASE RETURN A COPY OF THIS NOTICE WITH YOUR RESPONSE 


